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(57) ABSTRACT 

A method and apparatus for restoring communications in a 
network. The network includes a plurality of nodes, with 
each pair of nodes connected by a link, and with each link 
having information channels and restoration channels. An 
idle signal is sent on each restoration channel for each link. 
The failure of a link is detected, with the failed link 
connecting an originating node with a terminating node. In 
addition, the failed link includes at least one information 
channel carrying information signals. An alternate path 
through the network is determined for the information 
signals using restoration and idle signals sent over the 
restoration channels. The information signals are then routed 
from the originating node to the terminating node in accor- 
dance with the alternate path. 
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METHOD AND APPARATUS FOR originating node and destination node of the individual 

RESTORING A NETWORK circuits in the failed link. By way of contrast, link based 

restoration attempts to re-route all traffic around the failed 

This application is a continuation of application Ser. No. hnk regardless of the origination and destination of the 

08/936,369, now U.S. Pat. No. 6,130,876 filed Sep. 24, 5 bearer traffic on the failed hnk. 

1997 Link based restoration and path based restoration each 

have their advantages and disadvantages. For example, link 

FIELD OF THE INVENTION based restoration is typically faster than path based 

_ . t . , t t . , . t - » u * * restoration, but is less efficient in terms of restoration 

The invention relates to network restoration techmquesm h utilization. Conversely, path based restoration is 

general. More particularly, the invention relates to a method s]owef ^ ]mk base(J res , orali but utilizes restoration 

and apparatus for restoring network communications when resQurces mQre effidenl , ^ ^ ori ^ Dation ^ destina . 

a node or link of the network fails. , ion n(jdes of ^ faikd QQdes m typicaUy ^touted 

BACKGROUND OF THE INVENTION throughout the system. 

, .... 15 These techniques, however, are unsatisfactory for a num- 
It is a fairly accepted truism that communications is the ber of reasons For e le> a mm?kt( . ly optical laycr 
hfcblood of business. As domestic and international busi- netwofk aboye the Synchronous Optical Network (SONET) 
nesses continue to expand at an extrodinary pace, these h ^ fast becoming a rea]ity . ^ optical network is 5eing 
businesses become increasingly reliant upon telecommuni- drfven both by ^ commercial availability of dense wave- 
cations services to remain competitive in a global market- m j fa mllltiplex (qwdm) technology and the 
place. Whether it is talking to a customer over the Public contimling growth of traffic. 

Switched Telephone Network (PSTN), sending an electronic DWQM ^ ^ q Qr mQre 

mail message over the Internet or trading product specifi- Q(M8 Qn a ^ of flben . fo ^ ^ ft QW 

cations over a local area network, disruptions to a commu- . u j j i +u a *u t i 

. , , . ' f i to more than one hundred wavelengths, and the channel 

mcations network can mean significant losses to a business M ma tQ ^ ^ w Q{ b 

Extended outages are particularly harmful, but even brief £ s) * wh / n , substantial mmber of d m 

outages can be bothersome, lne result is ever increasing t . ' t L t . . 

, & , / r . ^ „ & the network, it will be necessary to manage the network at 

demands by telecommunications customers tor a virtually , u , , . * -n • #u u*i 

. ^ , me optical layer. This management will require the cap abil- 

unmterruptible network. ity lQ restofe tfae netvvork in tne optical layer Networking 

One element to creating a virtually uninterruptible net- 30 and rcstoration at the optical layer ^ highly desirable for 

work is to correct network outages as rapidly as they occur. optjcal sw i tc hin g systems. No signal will undergo optical to 

At a very high level, a network can be viewed as a pattern c i ec trical conversion at these optical cross-connect systems, 

of communications nodes interconnected by commumca- Therefore, restoration from a failure in the network will 

tions links. The communications nodes can include elec- either require communication and processing messages 

tronic or optical cross-connects ("switches"), personal 35 Detween the opucal cross-connect systems in the optical 

computers, servers, printers, or any other type of network domain or ^ auxiliary optical channel which will undergo 

device. The communications links include some type of Qptical to electrical conversion and processing just for 

media for transporting communications signals, such as messaging. It is desirable to eliminate the need of an 

optical fiber, twisted-pair copper wires, co-axial cable, radio auxi i iary channel for the purpose of restoration. Even if it is 

frequencies, and so forth. An example of a communications 4Q reqiJ i r ed for other purposes, it is extremely important that the 

network would be a set of communications switches pr0 cessing required at each node remains simple for imple- 

("switching fabric") connected together by optical fibers m enting a fast restoration technique in an optical network, 

("optical links"). If an optical link is damaged, as frequently Conventional network restoration techniques fail to address 

occurs such as when a construction company digs m the area any of these ^cerus, and are not designed to perform 

where the optical link is buned, the communications signals 45 network restoration in the optical domain, 

carried by the optical link must be quickly re-routed This is {q q{ ^ f { it caD be apprec iated that a 

also tme if a switching fabric becomes inoperable, although substantial needs exists for a method and appartus for 

the problem of re-routing the communications signals idin fast from a link or a node failure in a 

becomes an even greater challenge in this case. network> ^ golyes me above . discussed pro blems. 

Several conventional techniques have been developed to 50 

restore communications in the event of a link or node failure SUMMARY OF THE INVENTION 

on a network. These techniques are loosely referred to as The present invention includes a method and apparatus 

"network restoration techniques," and in most cases refer to for restoring communications in a network. The network 

an algorithm for re-routing the communications signals includes a plurality of nodes, with each pair of nodes 

carried by the failed link, or switched by the inoperative 55 connected by a link, and with each link having information 

node. For example, a class of algorithms have been devel- channels and restoration channels. An idle signal is sent on 

oped that are referred to as "flooding algorithms " Commu- each restoration channel for each link. The failure of a hnk 

nication messages for service restoration in case of a failure is detected, with the failed link connecting an originating 

in the network are transmitted through links between the node with a terminating node. In addition, the failed hnk 

switches. The switches then electronically process these $0 includes at least one information channel carrying informa- 

messages to take appropriate action to restore the failed tion signals. An alternate path through the network is deter- 

trafSc in the event of, for example, a link failure. mined for the information signals using restoration and idle 

There are basically two types of flooding algorithms for signals sent over the restoration channels. The information 

restoring the failed traffic in the event of a link failure. The signals are then routed from the originating node to the 

first is referred to as "link based restoration," while the 65 terminating node in accordance with the alternate path, 

second is referred to as "path-based restoration. Path based With these and other advantages and features of the 

restoration attempts to re-route failed circuits between the invention that will become hereinafter apparent, the nature 
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of the invention may be more clearly understood by refer- 
ence to the following detailed description of the invention, 
the appended claims and to the several drawings attached 
herein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates a network suitable for practicing one 
embodiment of the present invention. 

FIG. 2 illustrates an optical cross-connect system suitable 
for practicing one embodiment of the invention. 

FIG. 3 illustrates a network having a link failure in 
accordance with one embodiment of the invention. 

FIG. 4 is a block flow diagram of the steps for restoring 
a network in the event of a link failure in accordance with 
one embodiment of the invention. 

FIG. 5 illustrates a network having a node failure in 
accordance with one embodiment of the invention. 

FIG. 6 is a block flow diagram of the steps for restoring 
a network in the event of a node failure in accordance with 
one embodiment of the invention. 

FIG. 7 is a block flow diagram of the steps for restoring 
a network in the event of a node failure in accordance with 
another embodiment of the invention. 

FIG. 8 illustrates a block diagram of a detecting circuit in 
accordance with one embodiment of the invention. 

DETAILED DESCRIPTION 

The present invention includes a method and apparatus 
for optical network restoration. One embodiment of the 
invention is link based and comprises an automatically 
computed shortest route restoration algorithm. It does not 
require a sender and chooser node identification or band 
width requirement messages. It works from both ends simul- 
taneously and therefore, the restoration is fast for link 
failure. This embodiment of the invention is also applicable 
for a node failure in the network. It includes an in-band 
restoration algorithm which does not require any extra 
channel or any overhead (as in SONET^ and therefore is 
optimal for an optical network restoration. The in-band 
messaging and restoration method achieves fast restoration 
comparable to a SONET ring restoration and yet with less 
network restoration capacity. 

Referring now in detail to the drawings wherein like parts 
are designated by like reference numerals throughout, there 
is illustrated in F[G. 1 a network suitable for practicing one 
embodiment of the present invention. FIG. 1 shows an 
optical network 8 comprising multiple nodes labeled 1-5 
connected by DWM optical links. It is assumed that optical 
network 8 is OC-48 based, i.e., each operating channel in 
each DWDM optical link is carrying an OC-48 signal. In this 
embodiment of the invention, there is an optical cross- 
connect system in each node. It can be appreciated, however, 
that electrical cross-connects can be used and still fall within 
the scope of the invention. In such a case, each node would 
be equipped with electrical to optical (O/E) and E/O 
converters, and the appropriate signaling across said O/E 
and E/O converters will have to be implemented in accor- 
dance with the principles described herein, 

FIG. 2 illustrates an optical cross-connect system suitable 
for practicing one embodiment of the invention. The cross- 
connect system includes an optical switch fabric 30, 
multiplexers/demultiplexers (MUX) 32, a central processing 
unit (CPU) 34, and memory 36. Memory 36 further includes 
a network restoration module (NRM) 38. 

Optical switch fabric 30 performs switching functions by 
routing a signal from one of its input port to one of its output 
port. This routing is accomplished regardless of the rate of 
the signal. 
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MUX 32 performs wavelength adaptation and 
multiplexing/demultiplexing functions. With wavelength 
adaptation and multiplexing, a large number of OC-48 (or 
any other rate) signals can be transported over a single fiber. 
S Similarly, the reverse function, wavelength de-multiplexing 
and adaptation, are performed in the receive direction before 
the OC-48 signals interface with the optical cross-connect 
system. 

It is worthy to note that in this embodiment of the 

io invention, the wavelength adaptation and multiplexing or 
de-multiplexing functions are shown outside the cross- 
connect system. It can be appreciated, however, that these 
functions can be implemented in the photonic cross-connect 
system itself and still fall within the scope of the invention. 

15 Optical switch fabric 30 is connected to CPU 34. In this 
embodiment of the invention, CPU 34 is a dedicated pro- 
cessor for optical switch 30, but also may be an external 
processor. In any event, CPU 34 must have sufficient pro- 
cessing power to perform the functionality for NRM 38. 

20 CPU 34 operates under the control of executed computer 
program instructions that are stored in memory 36. In this 
embodiment of the invention, NRM 38 is in the form of 
computer program instructions stored in memory 36. It can 
be appreciated, however, that the functionality for NRM 38 

25 may be implemented in hardware, software, or a combina- 
tion of hardware and software, using well-known signal 
processing techniques. 

Memory 36 may be any type of machine readable storage 

3Q device, such as random access memory (RAM), read only 
memory (ROM), programmable read only memory 
(PROM), erasable programmable read only memory 
(EPROM), electronically erasable programmable read only 
memory (EEPROM), magnetic storage media (i.e., a mag- 

35 netic disk), or optical storage media (i.e., a CD-ROM). 
Further, the cross-connect system may contain various com- 
binations of machine readable storage devices accessible by 
CPU 38, and which are capable of storing a combination of 
computer program instructions and data. 

40 Referring again to FIG. 1, in optical network 8 each pair 
of nodes is connected by one or more optical links, each link 
having a pair of fibers for two-way transmission. Although 
each link is a two-way link, it is not necessary to assume that 
the traffic is also two-way for the restoration to work. 

45 Optical network 8 is designed in such a way that a 
majority of the channels in each link carry live traffic and a 
small number of channels are left vacant to be used for 
restoration in case of a network failure. To meet a given 
objective for restoration against failures, a network must be 

50 designed properly with respect to the topology and per link 
usage ratio. Issues associated with the design of such a 
network has been studied extensively for electronic cross- 
connect based networks. The same principles can be applied 
to design an optical network with desired restoration prop- 

55 erties in the optical domain. 

This embodiment of the invention utilizes three different 
types of signals that can be transmitted on the optical 
channels: (1) a service traffic signal (TS); (2) an idle signal 
(IS); and (3) a restoration signal (RS). The IS is transmitted 

60 over the channels not carrying service and are reserved for 
restoration in case of a failure in the network. The RS is 
transmitted over the restoration channels when they are to be 
used for service restoration in case of a failure in the 
network. 

65 The IS and RS may be any bit pattern, but must be simple 
enough for the cross-connects to detect, process and insert 
quickly. In this embodiment of the invention, a repeating 
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sequence of "101010 ..." bit pattern is used as the IS. The 
RS is a repeated bit sequence indicating the originating node 
number associated with a failed link. For other types of 
restoration, however, a different RS may be necessary. In 
any event, for any type of restoration, both IS and RS are of 5 
an arbitrarily low bit rate and independent of the traffic bit 
rate. Therefore the capability of timing recovery and detec- 
tion of IS and RS messages is implemented in the optical 
domain without the need for optical to electrical conversion, 
or in the electronic domain with simple circuitry. For 1Q 
example, even though the service channels may carry OC-48 
signals which need to be restored on the idle channels, the 
restoration channels can use a bit rate of only 10 Mb/s or 
lower for IS and RS. 

It is worthy to note that although this embodiment of the ^ 
invention is illustrated using an optical network, it can be 
appreciated that this embodiment of the invention is equally 
applicable to a general network regardless of the types of 
links and nodes in a network or the types of traffic carried on 
the links. An advantageous embodiment of the invention, 
however, provides the most benefit for optical network 
restoration. 

NRM 38 provides network restoration in the event of a 
link failure or a node failure. If there is a link failure, it is 
assumed that there is a single link failure, since the prob- 25 
ability of another independent link failure in optical network 
8 within the restoration period of the first link (sub second) 
is small. If there is a node failure, however, all links 
connected to the failed node fail simultaneously. 

Since the probability of a node failure is much lower than 30 
that of a link failure, NRM 38 first assumes that any failure 
is a result of a link failure and attempts to restore accord- 
ingly. That would result in fast restoration, if indeed it is a 
link failure. If the failure, on the other hand, is due to a node 
failure, it will take a longer time to restore the failed 35 
channels. 

FIG. 3 illustrates a network having a link failure in 
accordance with one embodiment of the invention. As 
shown in FIG. 3, each node in optical network 8 is assigned 
a unique number with a predetermined maximum. For 40 
illustration purposes only, optical network 8 is assumed to 
have five nodes, numbered 1 through 5, as shown in FIGS. 
1 and 3. Each pair of nodes is connected by an optical link. 
Further, assume that each link has eight wavelength chan- 
nels. The channels left vacant for restoration are shown on 45 
each link. These channels carry IS in normal condition. FIG. 
3 shows a failure in link the link between nodes 1 and 5 
("link 1-5"). Note that as used herein links will be desig- 
nated by the nodes they connect, with the direction of signals 
being designated as originating at the first node and termi- 50 
nating at the second node. Thus for link 1-5, signals are 
originating from node 1 and terminating at node 5. For link 
5-1, signals are originating from node 5 and terminating at 
node 1. When link 1-5 fails, service is restored following the 
method described with reference to FIG. 4. 55 

FIG. 4 is a block flow diagram of the steps for restoring 
a network in the event of a link failure in accordance with 
one embodiment of the invention. At step 40, NRM 38 
performs initiation for the originating node. Upon detection 
of failure of one or more channels in one of its links, a node 60 
inserts a signal carrying a binary word indicating its own 
node identification number (NID) to all outgoing healthy 
restoration channels in all links. The node also sends the 
failure message to a Network Management System (NMS) 
(not shown). 65 

It is worthy to note that in one embodiment of the 
invention NRM 38 runs as individual processes in each 



node, and therefore does not require a cental controller. In 
another embodiment of the invention, NMS can be used as 
a cental controller to facilitate certain functions of NRM 38. 
In such a case, reference to a NMS will be made where 
appropriate. 

Intermediate node connection is performed at step 42. 
When an intermediate node, that is not connected to the 
failed link, detects a change from IS to RS in an incoming ' 
restoration channel, it connects the channel to one restora- 
tion channel in each outgoing link. If multiple channels in an 
incoming link change from IS to RS (as in the case of a link 
failure) then the maximum number of such channels are 
connected to the restoration channels in each outgoing link. 
Since there will be, in general, more than one other link, 
these will be broadcast type connections. 

At step 44, the loop release by the intermediate Node is 
performed. Subsequent to making connections according to 
step 42 in one direction, when an intermediate node receives 
another message signal from another link, the node checks 
if the originating node number in this message signal is 
identical to that in the first message. If so, the link is released 
of all connections and IS is inserted in each restoration 
channel on that link. 

An intermediate node reverse connection is made at step 
46. After an intermediate node has already made a connec- 
tion in one direction, when it receives a message signal from 
an idle channel of another of its incoming links, it checks if 
the originating node number is different from the one 
received during its first connection. If the originating node 
number is different, the intermediate node connects the 
maximum number of restoration channel carrying RS to the 
restoration channels in the outgoing direction on the link 
whose incoming channels were first connected. It is worthy 
to note that in another embodiment of the invention, the RS 
may carry both the originating and terminating node IDS. In 
that case each intermediate node matches both IDs in reverse 
order for two directions of transmission. 

The terminating node connection is performed at step 48. 
When a node associated with the link failure receives 
messages carrying another node ID on any incoming resto- 
ration channel, it disconnects the input ports from the 
outgoing channels on the failed link to these restoration 
channels. Upon completion of the connections, the termi- 
nating node inserts IS in place of RS in to all other 
restoration channels. The terminating nodes then inserts IS 
to the restored channels on the failed link so that the 
receiving node is alerted when the link is repaired. As a 
further confirmation, the terminating nodes may also check 
the originating node IDs in the incoming RS signals to 
ensure that they are indeed the nodes at the other ends of the 
failed link. 

The intermediate node releases all connections at step 50. 
When the signal in any restoration channel is changed from 
RS to IS, an intermediate node releases all connections to the 
incoming restoration channel and inserts IS in the outgoing 
disconnected restoration channels. 

NRM 38 terminates network restoration at step 52. Steps 
40 through 50 are repeated until one of the following four 
conditions is satisfied: (1) all failed channels are restored; 
(2) there is no more restoration channel on any outgoing 
link; (3) a predetermined time-out period expires and no RS 
is received in any incoming restoration channel; and (4) the 
node receives a command from the NMS to halt the process. 

At step 54, all operations of optical network 8 are returned 
to normal. When the failed link is repaired the nodes 
associated with the failed link receives IS from the repaired 
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link. Then the restored signals are connected to their original 
ports on the repaired link, the received signals are checked 
by the receiving nodes for the validity of the signals and then 
disconnected from the restoration channels. IS is then 
inserted on the disconnected restoration channels. Upon 
receiving the IS, the intermediate nodes remove the resto- 
ration connections and inserts IS to the outgoing restoration 
channels. The network then returns to its original state. 

If the restoration process is terminated before all failed 
channels are restored and yet spare channels are available, it 
is because either the failure is only in the incoming direction 
(single-ended failure) or it is a node failure or there is no 
spare capacity between the two nodes. If it is because of 
single-ended failure, the NMS can detect that from the 
failure messages sent by the nodes. Then the NMS can 
command the involved nodes to restore the channel in both 
directions and the two nodes can then restore the channel 
following the above rules. Similarly if it is due to node 
failure, the NMS can detect that from the failure messages 
from the adjacent nodes and command the involved nodes to 
begin restoration from a node failure which is discussed 
next. 

The operation of NRM 38 can be better illustrated using 
the following example, which makes reference to FIG. 3. As 
shown in FIG. 3, link 1-5 fails. Upon the failure of link 1-5, 
node 1 continuously transmits the binary word "00000001" 
to all outgoing idle channels: channels 6, 7 and 8 of link 1-3 
and channels 7 and 8 of link 1-2. Specific implementation 
method of the coding and decoding of the optical message 
signal RS will be discussed later. On detection of the failure 
of link 1-5, node 5 also performs the similar functions as 
node 1 independently. 

Upon detection of the change from IS to RS from chan- 
nels on links 1-3 and 1-2, nodes 2 and 3 read the originating 
NID (1 in this case) carried on the restoration channels (6, 
7 and 8 on link 1-3 and channels 7 and 8 on link 1-2), and 
connect the incoming restoration channel ports to the out- 
going restoration channel ports in all other links. Since there 
are multiple restoration channels in each link the channels 
may be connected sequentially. Thus node 2 connects chan- 
nels 7, 8 from link 1-2 to channels 7, 8 of link 2-3 and also 
to channels 7 and 8 of link 2-4. Similarly node 3 connects 
channels 6 and 7 of link 1-3 to channels 7 and 8 of link 3-4, 
to channels 7 and 8 of link 3-5 and to channels 7 and 8 of 
link 3-2. 

Within a short period of time after node 2 connects 
restoration channels from link 1-2 to all other outgoing links, 
it will receive from node 3 on link 3-2 RS with the NID as 
1 . Upon detecting that the restoration channels 7 and 8 from 
link 3-2 have the same originating node number as in 
channels 7, 8 on link 1-2, node 2 releases all connections 
(channels 7 and 8) to link 2-3 and changes the signal on these 
channels in link 2-3 to IS. Similarly node 3 releases con- 
nections to channels 7 and 8 of link 3-2 and inserts IS. 

After node 3 has connected the restoration channels from 
link 1-3 to link 3-5 and 3-4, at some point of time depending 
on the link lengths, it will receive either from node 4 or node 
5 RS signals on the restoration channels. Assuming that the 
signal propagation time on link 5-3 is shorter than that on 
links 5-4 and 4-3 combined, node 3 receives RS from node 
5 on the restoration channels 7 and 8 of link 5-3. Noting that 
the originating node number is now 5, node 3 connects the 
restoration channels 7 and 8 to the outgoing channels 6 and 
7 of link 3-1 . Note that node 3 does not connect channels 7 
and 8 from link 5-3 to link 3-2 because they are no longer 
connected in the 2-3 direction. 



25 
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Upon receiving RS from restoration channels 6 and 7 
from link 3-1, node 1 disconnects the input ports from 
channels 1 and 2 on link 1-5 and connects to channels 6 and 

7 on link 1-3. Subsequently, node 1 inserts IS in channels 7 
and 8 on link 1-2, in channel 8 of link 1-3 and in channels 
1, 2 in link 1-5. Node 5 performs similar functions after it 
receives RS from link 3-5. Thus channels 1 and 2 on the 
failed link 1-5 are restored on the path 1-3 (Channels 6 and 

7) and 3-5 (Channels 7 and 8). 

Node 2 receives IS (changed from RS) in channels 7 and 

8 on link 1-2. Node 2 then disconnects channels 7 and 8 on 
link 2-4 and inserts IS in these channels. This process 
continues to all the intermediate nodes and all the remaining 
restoration channels become free. 

Nodes 1 and 5 know that they still have to restore 
channels 3, 4 and 5 on link 1-5. It waits for a predetermined 
period, say 1 millisecond, and then follows steps 40 through 
50 to restore channel 3 on route 1-3 (using channel 8), 3-4 
(using channel 7), and 4-5 (using channel 7). In the subse- 
quent attempt, it restores channels 4 of link 1-5 on route 1-2 
(using channel 7), 2-4 (using channel 7), 4-5 (using channel 

8) . Then node 1 attempts to follow steps 40 through 50 again 
for the remaining channel 5. However, it never receives any 
RS from the incoming channel 8 on link 2-1 because no 
more restoration channel is available. Node 1 then termi- 
nates any further restoration attempt after a waiting period 
expires. Node 5 finds that there is no more outgoing resto- 
ration channel available in any link. It immediately termi- 
nates any further attempt for restoration. Both nodes 1 and 
5 communicate with the NMS that it could not restore 
channel 5 on link 1-5. The NMS attempts to restore service 
carried on channel 5 in link 1-5 at a lower layer such as 
SONET layer, Asynchronous Transfer Mode (ATM) layer or 
electronic cross-connect layer. SONET layer restoration can 
be either ring restoration or path based restoration. An 
overall network restoration approach, however, must con- 
sider many important issues: network configuration and 
cost, coordination of restoration among various layers, speed 
of restoration, and prioritization of restoration of different 
services/channels. 

Nodes 1 and 5 receive IS from channels on link 1-5 when 
it is repaired. On receiving IS, node 1 and node 5 bridge the 
corresponding input ports to the service channels 1 through 

4 on link 1-5. Nodes 1 and 5 then check for the validity of 
the received signals in channels 1 to 4 and then disconnect 
these signals from the restoration channels. It is worthy to 
note that channel 5 on link 1-5 was never disconnected 
because it was not restored in the optical layer. Nodes 1 and 

5 then insert IS to outgoing restoration channels in links 1-2, 
1-3 and 5-3, 5-4. The intermediate nodes 2, 3 and 4 discon- 
nect the restoration channels in the outgoing directions and 
insert IS. 

NRM 38 provides the means of SONET equivalent span 
protection switching in an optical line. If only one channel 
in a link fails, then NRM 38 restores the channel on the 
restoration channel in the same link provided that is the 
shortest link between the two nodes. With respect to FIG. 3, 
if any one or more (up to three) channels on link 1-5 fail then 
they will be restored on channels 6, 7, and 8, provided 1-5 
is the shortest path between nodes 1 and 5. 

FIG. 5 illustrates a network having a node failure in 
accordance with one embodiment of the invention. If there 
is a node failure in the network, the first attempt to restore 
a link as described previously will not be successful. When 
the time-out period is expired or a command is received 
from the NMS, NRM 38 begins the restoration from a node 
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failure. Note that unsuccessful link restoration attempt could 
have been due to two or more simultaneous and independent 
link failures. The probability of another completely inde- 
pendent link failure within the restoration time of the first 
failed link, however, is extremely small. Simultaneous link 5 
failures, on the other hand, is most likely to be associated 
with a node failure. Therefore, at the end of an unsuccessful 
link restoration attempt, NRM 38 assumes that the cause of 
the failure is a node failure and it begins a node restoration 
process. Alternatively, NRM 38 can wait for commands 10 
from the NMS to begin node restoration. The NMS, of 
course, can detect the node failure without any ambiguity. 
The steps performed for network restoration in the event of 
a node failure will be described with reference to FIG. 6. 

FIG. 6 is a block flow diagram of the steps performed for 15 
restoring a network after a node failure in accordance with 
one embodiment of the invention. Since a node failure 
causes multiple link failures, it is now necessary to select 
and restore the failed channels sequentially to avoid con- 
gestion during restoration by flooding method. For that 20 
purpose, a connection map of the network is maintained in 
memory 36 of each cross-connect system. When a node 
determines that the cause of the failure of its link to another 
node is the failure of the latter, it looks up the connection 
map at step 60. From the connection map, each associated 25 
node determines its rank relative to the others. For example, 
the ranks can be assigned with higher to lower as the NID 
increases. 

If the node is of highest rank, it begins restoration of the 
failed channels on the link to the failed node at step 62. None 30 
of the other nodes connected to the failed node originates 
any restoration attempt. 

To consider the restoration against node failure some 
aspects of the link restoration method need to be modified. 35 
To restore a single link failure, it was sufficient that only the 
originating node ID was transmitted in the RS signal. It is, 
however, desirable to include in the RS signal both the 
originating and the target NIDs for confirmation purposes. 
For restoration from node failure, on the other hand, it is ^ 
necessary that the RS signals carry both the originating and 
the terminating node IDs so that the optical paths can be 
restored around the failed node. 

The highest ranking node inserts RS to the restoration 
channels for all outgoing links at step 64, The highest 45 
ranking node selects a failed channel according to some 
order, which in this embodiment is on the basis of priority. 
The node then inserts an RS signal, which contains the 
originating NID (its own ID), terminating NID (the desti- 
nation NID for the selected channel), to the first restoration 50 
channel in each of the outgoing links. If there are additional 
restoration channels in any outgoing links it selects the next 
failed channel and inserts an RS (with its NID and the 
selected channel's destination NID) to the next restoration 
channel in any outgoing link. The process is continued until 55 
all failed channels or all outgoing restoration channels are 
exhausted. No attempt is made, however, to restore the failed 
channels which are terminated at the failed node. This can be 
accomplished by maintaining and consulting a table of 
terminating NIDs for each channel passing through the 60 
cross-connect system in its database. 

There is at least one major difference between the single 
link restoration (SLR) and the single node restoration (SNR) 
functions of NRM 38. Id the case of SLR, each node works 
independently and simultaneously to restore the failed link. 65 
This method leads to a faster restoration. In the case of SNR, 
however, it is necessary to restore the channels on a failed 
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link only from one node, namely, the higher ranking node to 
avoid congestion. When the receiving node (e.g., lower rank 
of the two) detects the RS signal from the higher ranking 
node, it restores the failed channel whose destination is the 
higher ranking node at step 66. 

After the highest ranking node either completes the res- 
toration of failed channels connected to itself or all outgoing 
restoration channels are exhausted at step 68, the next 
highest ranking node is selected at step 70 and begins the 
restoration of the channels on its failed link. When the 
second ranking node decides that all failed channels termi- 
nated at the highest ranking node are restored or a time-out 
period expires, it begins restoration of the channels ta are 
terminated to all the lower ranking nodes. Note that none of 
the nodes will begin any restoration until all the channels 
that are terminated at a higher ranking node are restored or 
a time-out period for the node expires. This process contin- 
ues until all nodes restore their failed channels. If channel 
restorations are not complete at step 68, steps 66 and 68 are 
repeated until they are complete. Similarly, if node restora- 
tions are not complete at step 72, steps 62 through 72 are 
repeated until they are also complete. 

Network restoration in the event of a node failure can be 
better illustrated using the following example, which makes 
reference to FIG. 5. Assume that highest to lowest ranking 
of the nodes are from 1 to 5. When node 3 fails, each node 
associated with the failed links attempt to restore the links. 
Of course, none will succeed because of no response from 
node 3. After a time-out period, say, 100 milliseconds, the 
involved nodes 1, 2, 4 and 5 independently assume that the 
other end of the links, i.e., node 3 has failed. Node 1 being 
the highest ranking node, begins restoration of the failed 
channels. Looking at the destination nodes of the failed 
channels, it restores the channels between itself and the 
destination nodes of the individual channels. Node 1, 
however, does not attempt to restore the channels whose 
final destination are node 3. In the meantime, nodes 2, 4 and 
5 do not attempt to restore any channels yet. Each of these 
nodes, looking at the network connection map in its own 
database, determines which are the other affected nodes due 
to the failure of node 3. For example, node 2 knows that 
node 3 has failed and nodes 1, 4 and 5 besides itself are 
connected to node 3. From this list it decides that its rank 
among the affected nodes is second after node 1. Therefore, 
it waits for a time-out period. This time-out period can be 
dependent on whether there is any failed channel between 
nodes 1 and 2. If there is a channel between nodes 1 and 2 
through node 3 that failed, then node 2 waits until all the 
failed channels between nodes 1 and 2 are restored or all 
restoration channels are exhausted. On the other hand, if 
there is no failed channel between nodes 1 and 2, then it can 
begin restoration of channels to other nodes immediately. 
Node 2 begins restoration of any failed channels between 
itself and the affected lower ranking nodes 4 and 5. Nodes 
4 and 5 still wait for the node 2 to complete its restoration 
attempt in a similar manner. This process is continued until 
the last but one node (node 4). Depending on the available 
restoration channels, some or all failed channels will be 
eventually restored. Note that the restoration from a node 
failure will take longer time (of the order of 0.5 to 1 second) 
because simultaneous restoration attempts between multiple 
pairs of nodes may interfere with each other leading to 
higher probability of failed attempts. The probability of 
congestion is minimi2ed in the proposed method at the 
expense of longer restoration time for the less likely case of 
a node failure. 

FIG. 7 is a block flow diagram of the steps performed for 
restoring a network in the event of a node failure in 
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accordance with another embodiment of the invention. In channels in each link used for normal traffic (1-f is the 

this embodiment of the invention, steps 62 through 72 are fraction of the channels in each link available for 

similar to the steps described with reference to FIG. 6. In restoration), KAverage link length, c=Speed of light in fiber, 

FIG. 7, however, the prioritization of nodes can be done by t -Processing time at each node which includes the time to 

the NMS at step 74 rather than by the nodes themselves. In s | etcct change in signal type and to set up or remove channel 

this case, the NMS will command the pairs of nodes to *• j * „ 7 ■ j . 

■ii * ! <• n t*u ktxxo * connections, and tw«Waitine period between two attempts 

restore specific channels sequentially. The NMS can receive .' & r * 

confirmation from the pairs of nodes before issuing another restoration. 

restoration command to avoid congestion in the restoration Using this equation, a typical restoration time can be 

process. 5Q estimated as follows. Assume that each link has 16 channels 

NRM 38 is applicable for restoration of a failed link or a of which 12 are used for normal traffic and 4 channels are 

failed node around the nodes adjacent to the failure location. available for restoration. All channels are restored in three 

This is essentially a link based restoration. NRM 38, attempts. Because of the simplicity of the RS signal, even 

however, can perform path based restoration which provides ^ a low bit rate such as 10 mi a processin g time of 0.5 

more efficient utilization of restoration capacity at the millisecond ^owld be sufficiem> Tf 8 bits are reserved for 

expense of restoration speed. The speed is compromised for JJ , . .... * . , m , 

several reasons: (1) the restoration cannot begin until the " ch . thc ^ginaUng node ID, terminating node ID, and 

path terminating node receives the path AIS (Alarm Indica- Naming, then within a period of 0.5 millisecond, there are 

tion Signal) from the nodes adjacent to the failure location; 208 frames to detect and confirm the simple repeated 

(2) more intermediate nodes are involved for a path resto- message. Another 0.5 millisecond should be sufficient for 

ration compared to link restoration; and (3) every pair of 20 connecting or disconnecting the channels. Therefore, 1 

nodes for each failed path will simultaneously attempt to millisecond for tp is a conservative estimate. An estimate of 

capture restoration capacity which may lead to conflict and 1 millisecond for tw is also conservative. With these 

its resolution may take longer time. Considering the conflict numbers, the restoration time for a link failure is conserva- 

resolution difficulties and restoration speed, link based res- tively estimated to be 68 milliseconds. Out of the 68 

toration is considered to be the more advantageous embodi- 25 milliseconds, the signal propagation time of 28 milliseconds 

ment of the invention even though the path based restoration ^ fixed. However, it is possible to reduce the processing 

is more capacity efficient. This determination, however, is t i me . jf j t ^ reduced to 0.5 millisecond, for example, the 

contingent upon the type of network and desired perfor- total restoration time is less than 50 milliseconds, 

mance parameters, and is in this respect application specific. ^ from ^ ^ ^ ^ 

The cross-connect systems must be able to detect the ^ u wm yary d nding on the connectivity of the 

change of state of each of the incoming channels. As network. If the degree of the failed node in a network is four 

mentioned earlier, significant simplification can be achieved Qr fi ft restoration time of less man 500 miUiseconds can 

by making the IS and the RS independent of the rate of the be acn j eve d 

TSs. As an example, the IS can be a "101010 ..." pattern. A1 _ ' . . . .« « .„ . ^ 

The RS, on the other hand, will carry the originating node ID 35 ^hough various embodiments are specifically illustrated 

and the terminating node ID. Therefore, the signal can be ™* des ? lbed nerem it will be appreciated that modifica- 

generated as a framing pattern appended with two ID u and vanaUons of f e ^ { 1DVemi0D ^eredby 

numbers of the originating and terminating nodes. the above teachings and within the purview of the appended 

o -i, * * i_i f j- r i * *• * - claims without departing from the spirit and intended scope 

FIG. 8 illustrates a block diagram of a detecting circuit in - . . c i nu u *u *u j a 

. ... . j . n • • r- i. 40 of the invention. For example, although the method and 

accordance with one embodiment of the invention. Each . , -uju i-iuj l*l 

r t j i * * • - 4 u i oa ii apparatus described above uses a link based approach, the 

input port for the detecting circuit has a splitter 84, a clock rr t , « , . , ,. c J( i • »l 

• o/v « j * . . ™ * * * method and apparatus can be modified to apply in the case 

recovery errant 80 and a decision circuit 82. The output for Qf based ^ estoraljon as weU and wjthin lhe 

fabric°30 CirCm "* 56 ^ scope of 11,6 invention - In another e * am P le > ,he 

. _ , , « various embodiments of the invention utilize optical cross- 

As shown in FIG. 8, each signal is fed to a clock recovery conQects it can be appreciated that electrical cross-connects 

circuit 80 and a decision circuit 82. The change in the state M ^in the scope of the invention as well. 

of the signal type is decided by decision circuits 82 and a . 

control signal is generated to control the switching state of - 5 M ? lS * - ... 

the switch fabric. The IS and RS bit rates should be identical cn ***** for communications ,n a network 

so that the decision circuit can recognize the state change 50 ^ network having a plurality of nodes, with each pa* of 

from the IS to RS quickly. Since the TS and RS may be of nodes connected by a link with each lmk having informa- 

different bit rates, the timing recovery circuit may take tlon <* annels and dedicated restoration channels, compns- 

longer to recover clock and hence to recognize the state m & ste P s 

change from TS to IS. However, this state change recogni- 5S sending an idle signal on each dedicated restoration 

tion need not be fast. channel for each link; 

The restoration time depends on the number of channels detecting a failure of an intermediate node between an 

to be restored, number of restoration channels available in originating node and a terminating node, said interme- 

the alternate routes, the link lengths, and to a less extent on diate node switching information signals carried by at 

the bit rate of the RS signal. An estimate for the restoration 6Q i ea st one information channel for a plurality of links 

time is given by the following equation: terminating at said intermediate node and carrying 

information signals from said originating node to said 

T r =-iN + 3) + ,,N[N + 5) + Ni w terminating node; 

c determining an alternate path through the network around 

65 said failed node using restoration and idle signals sent 

where, N>=Ceiling [f/(l-f)] f the number of attempts required over dedicated restoration channels for links not ter- 

to restore all channels in a failed link, f-Fraction of the minating at said failed node; and 
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routing said information signals from said originating 
node to said terminating node in accordance with said 
alternate path, 

wherein said step of determining comprises the steps of: 
identifying information channels for nodes having links 
terminating at said failed node; 



14 

ranking said nodes using a connection map; 

sequentially restoring said information channels for 
nodes having links terminating at said failed node 
according to said rankings until all said information 
channels are restored. 

***** 
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